Demo of SemIndex: Semantic-Aware Inverted Index on Text

نویسندگان

  • Joe Tekli
  • Richard Chbeir
  • Yi Luo
  • Marc Al Assad
  • Carlos Raymundo Ibanez
  • Agma J. M. Traina
  • Caetano Traina
  • Kokou Yetongnon
چکیده

Processing keyword-based queries is a central problem in Information Retrieval (IR), where several studies have been done to develop effective keyword-based search techniques [1, 2]. A standard containment keyword-based query, which retrieves textual identities that contain a set of keywords, is generally supported by a full-text index. The inverted index is considered as one of the most useful full-text indexing techniques for large textual collections [3], supported by many RDBMSs. It is also increasingly used on semi-structured [2] and unstructured data [1] to support keyword-based queries. Besides the standard containment keyword-based query, the so-called semantic-aware or knowledge-aware (keyword) query has emerged as a natural extension, encouraged by (nonexpert) user demand. In semantic-aware queries, some knowledge needs to be taken into consideration while performing query processing. To illustrate this, let’s assume having data from a movie database , as shown in Table 1. Each movie in , identified with an id, is described with some text, including movie title , year and plot. For queries “sound of music”, “Maria nun” and “sound Maria”, the query result is movie O3. However, if the user wants to search for a movie but cannot recall the exact movie title, or the exact plot description, it is natural to assume that she may modify the query terms to some semantically similar terms, for example, “voice of music”.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

SemIndex: Semantic-Aware Inverted Index

This paper focuses on the important problem of semanticaware search in textual (structured, semi-structured, NoSQL) databases. This problem has emerged as a required extension of the standard containment keyword based query to meet user needs in textual databases and IR applications. We provide here a new approach, called SemIndex, that extends the standard inverted index by constructing a tigh...

متن کامل

بررسی کاربرد فناوری معنایی برای سازماندهی اطلاعات در نرم‌افزارهای کتابخانه دیجیتالی

The present study was an attempt to investigate the use of semantic technologies to organize information in digital library software systems. The present study was a practical one which employed a descriptive survey method. The study sample consisted of three digital library software systems entitled Pars Azarakhsh, Parvan Pajoh, and Payam Mashregh. Data were collected through a checklist incl...

متن کامل

Semantic Prosody: Its Knowledge and Appropriate Selection of Equivalents

In translation, choosing appropriate equivalent is essential to convey the right message from source-text to target-text, and one of the issues that may have a determinative role in appropriate equivalent choice is the semantic prosody (SP) behavior of words and the relation existing between the SP of a word and semantic senses (i.e. negativity, positivity or neutrality) of its collocations in ...

متن کامل

Semantic Prosody: Its Knowledge and Appropriate Selection of Equivalents

In translation, choosing appropriate equivalent is essential to convey the right message from source-text to target-text, and one of the issues that may have a determinative role in appropriate equivalent choice is the semantic prosody (SP) behavior of words and the relation existing between the SP of a word and semantic senses (i.e. negativity, positivity or neutrality) of its collocations in ...

متن کامل

Aggregation-Aware Top-k Computation for Full-Text Search

A typical scenario in information retrieval and web search is to index a given type of items (e.g., web pages, images) and provide search functionality for them. In such a scenario, the basic units of indexing and retrieval are the same. Extensive study has been done for efficient top-k computation in such settings. This paper studies top-k processing for many emerging scenarios: efficiently re...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2015